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ABSTRACT 

The popular halo model predicts that the power spectrum of the galaxy fluctuations 
\^ ' is simply the sum of the large scale linear halo-halo power spectrum and the weighted 

power spectrum of the halo profile. Previous studies have derived halo parameters 
' from the observed galaxy correlation function. Here we test the halo model directly 

^ ' for self-consistency with a minimal set of theoretical assumptions by utilising the 2dF 

^ : Galaxy Redshift Survey (2dFGRS). We derive empirically the halo occupation and 

" ' ' galaxy radial distributions in the haloes of the 2dF Percolation-Inferred Galaxy Group 

J£ -J \ (2PIGG) catalogue. The mean halo occupation number is found to be well-fitted by a 

^ . power-law, {N\M) oc M^, at high masses, with (3 = 1.05,0.88,0.99 for red, blue and 

' all galaxies respectively (with 1-sigma errors of 15-19%). We find that the truncated 

, NEW profile provides a good fit to the galaxy radial distributions, with concentration 

parameters c = 3.9,1.3,2.4 for red, blue and all galaxies respectively (with 1-sigma 
errors of 8-15%). Adding the observed linear power spectrum to these results, we 
compare these empirical predictions of the halo model with the observed correlation 



Q ■ functions for these same 2dF galaxy populations. We conclude that subject to some 

^ \ fine tuning it is an acceptable model for the two-point correlations. Our analysis also 

C/3 i explains why the correlation function slope of the red galaxies is steeper than that of 

' the blue galaxies. It is mainly due to the number of red and blue galaxies per halo, 

rather than the radial distribution within the haloes of the two galaxy species. 

•i-H . 

^ . Key words: large-scale structure of Universe - galaxies: haloes - galaxies: statistics 

' ~ dark matter 



1 INTRODUCTION 

It is well known that galaxies of different types are not iden- 
tically distributed within galaxy groups (manifested in the 
well-known phenomenon of morphological segregation, e.i 
iDressle j IToSol: iPostman fc Gelled Il984 iGoto et alJ 12003 
Red, typically elliptical galaxies, with low star-formation 
rates are preferentially found towards the centres of large 
groups, while blue, actively star-forming galaxies dominate 
in the outskirts of groups and in the field. Furthermore, mea- 
surements of the galaxy-galaxy two-point correlation func- 
tion have shown that it is well fitted by a power- la w over a 
wide range of distance scal es (e.g. lPeebles 198Q. : .Zehavi et alJ 
l2002l:lHawkins et al .120031) . and that the slope of this power- 
law is a function of galaxy spectral type or colour (e.g. 
I Zehavi et all l2002l : iNorberg et"al] l2002al: iMadewick et alJ 
l2003f) : blue galaxies have a shallower two-point correlation 
function than the red galaxies. The correlation function of 
the dark matter, on the other hand, has a lower amplitude 
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and is far from being a featureless power-law, thus the galaxy 
distribution is said to be biased with respect to the dark 
matter. 

Much recent progress towards understanding the nature 
of galaxy biasing has come through use of the halo model of 
la rge scale structure. The model has its origins in the work 
of lNevman fc Scotd lll952l1. and was first applied to c ont inu- 
ous density fields by Scherrer fc Bertschinged dlOQJl . It has 



dark matter and galaxies (e.g. 'Peacock & Smith*2Q0Cf; 


'Sehari 


2000; 


Scoccimarro ct al. 2001; Bcrlind & Weinberg 


20q1 


Maghocchetti & Porciani 20031 Ivan den Bosch. Yang & Md 


2ooai. 


.Coorav & Sheth, (.2002f) nrovide a detailed review. In 



the model, matter in the universe is assumed to reside only 
in discrete haloes. The distribution of the matter within the 
haloes gives rise to the non-linear component of the power 
spectrum, while the large-scale distribution of the haloes 
in space is responsible for the power on linear scales. The 
simplicity of the model allows analytic calculations of corre- 
lation functions to be made, so that large parameter spaces 
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can be investigated at relatively little cost when compared 
to numerical simulations of the large scale structure. 

Two ingredients are required to extend the model to 
galaxy clustering: the probability distribution for the num- 
ber of galaxies hosted by a particular halo, P{N\M), known 
as the halo occupation distribution (HOD), and the spatial 
distribution of galaxies within haloes. The distribution of 
the galaxies within haloes is commonly assumed, ad hoc, to 
be the same as that of the dark matter; differences in the 
clustering properties are then solely due to the choice of halo 
occupation distribution. We do not rely on this assumption 
in our analysis below, but instead obtain the actual galaxy 
group radial density distributions observationally. This al- 
lows us to con firm, for observed H ODs and radial profiles, 
the findings of ISheth et alJ i200ll) . who used HODs drawn 
from semi-analytic models to demonstrate the dominance of 
the HOD over the intra-halo distribution in the correlation 
function. 

Previous studies have assumed the validity of the halo 
model, and used the observed galaxy correlation functions to 
constrain the halo occupation distribution, or the radial dis- 
tribu t ion of galaxies in haloes (e . g. Magliocch ctti fc Porciani 
l2003l: IScrantonI lioO^: Ivan den I3osch et alll20o3) . In con- 
trast, we take as empirical an approach as possible. We 
directly measure the radial and halo occupation distri- 
butions of galaxies in the 2dFGRS Percolation-Inferred 
Galaxy Group (2PIGG) catalogue ( Eke ct al. 2004a), fo- 
cusing in particular on the separate distributions of red 
and blue galaxies. Having measured these distributions, no 
free parameters remain in the model. Since the correla- 
tion functions of these popula tions have p reviously been di- 
rectly obtained I Nor berg et al. 2002a: .Hawkins et al.ll2003l : 

iMadgwick et al.ll2003^ . the non-trivial success of the halo 
model in reproducing the observed clustering statistics can 
be directly tested using fully self-consistent 2dF observa- 
tions. Requiring the model to account for the differences in 
the observed correlation functions of the red and blue pop- 
ulations is an especially stringent test. 

The structure of this paper is as follows. In Section|21we 
describe the halo model in detail. We introduce the 2PIGG 
catalogue in Section|3]and describe how the properties of in- 
terest are inferred from the observational data. In Section 2] 
the halo occupation distribution is investigated, and in Sec- 
tion |S] we examine the radial distribution of galaxies within 
the 2PIGGs. Finally, in SectionlH] the halo model is applied 
to our results, and the predicted clustering results compared 
with observations. 



2 THE HALO MODEL FORMALISM 

In essence, the halo model exemplifies the natural distinc- 
tion between linear (large-scale) and non-linear (small-scale) 
clustering. Indeed, the two regimes appear as separate terms 
in the halo model galaxy power spectrum: 



(1) 



with PgH^'' (k) the intra-haio, non- linear term, and Pg'^^\k) 
the inter-halo, linear term. Although we will later require 
the two-point correlation functions, ^{r), explicit calculation 
of these involves convolutions of the halo profiles; we prefer 



instead to work in Fourier space, ultimately obtaining the 
correlation function via the transform 



Cgg(0 = 



.2 /, N sin(fer) dfc 



(2) 



with Agj^i(fc) — ^^Pgai(A;), the dimensionless form of the 
power spectrum. 

Within the halo model framew ork, the components o f 
the galaxy power spectrum are (e.g. ICoorav fc Shethll2002^ 



D{lh) 



and 
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where the integrals are over the halo mass, M. In these ex- 
pressions, n{M) is the halo mass function, b{M) is the halo 
biasing factor, {N\M) and {N{N - 1)\M) are the first and 
second factorial moments of the halo occupation distribu- 
tion, P{N\M), respectively, figai is the average number den- 
sity of galaxies, and iigai(fc|M) is the Fourier transform of 
the normalised radial distribution of galaxies within haloes. 
The following sections describe these terms in more detail. 

In keeping with the ethos of this work, we opt to use 
the simplest possible implementation of the halo model. We 
note for completeness that the implementation may be mod- 
ified to account for the expectation that the first galaxy 
in each halo resides at the halo centre of mass (see e.g. 
ICoorav fc Shethll2002l : iKravtsov et"al]l2004ll . 

2.1 Halo mass function 

The number density of haloes of mass Al in space is de- 
scribed by the mass funct ion, n(A /), which we assume to 
have the form proposed b vlSheth fc To rmcn ( 19991) (an ex- 
tension of the lPress fc Schechteirill974) form): 



n(M)dM = f^f{u)du, 



A (! + (,.)-) (g) 



1/2 



-giy/2 



(5) 



(6) 



where p is the background density of the universe, q = 0.707, 
p — 0.3, and normalisation implies At « 0.3222. The mass 
variable is defined asiy= {Ssc{z)/(j{M))'^ , where Ssc{z) is the 
linear-theory prediction for the present day overdensity of a 
region undergoing spherical collapse at redshift z, and a{M) 
is the r.m.s. variance of the present day linear power spec- 
trum in a spherical top-hat which contains an average mass 
M. Note that the mass function depends on the redshift only 
through Ssc{z) = 5sc(0)/D(z), where 4c(0) ^ 1.68, and D{z) 
is the linear growth factor, normalized so that D{0) = 1. The 
value o f ^sc(O) is only weakly se nsitive to the cosmological 
model jEke. Gole fc Frenklll99d) . 



2.2 Halo biasing 

Haloes are biased tracers of the overall dark matter distri- 
bution. T he degree of bias is a function of the halo mass. 
Following IMo fc White! (|l996i) , we can write the power spec- 
trum of dark matter haloes of given masses. Mi and M2 as 
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Phh(fe;Mi,M2) = b{M^) h{M2) Pdm(fc). 



(7) 



We further assume Pdm(fc) = Pd^\k), the hnear dark mat- 
ter power spectrum, since inter-halo correlations are ouIy im - 
portant on large, quasi-linear scales. lsheth fc TormenI (Il999l) 
derive the required halo bias factors. 



b{M) = 1 + 



qv — 1 2p/5sc{z) 



(8) 



5sc{z) l + {qv)P 
with p and q taking the values given in Section [2. II 

2.3 Galaxy distribution 

The halo occupation distribution, P{N\M), appears in the 
power spectrum (equations and through its first and 
second factorial moments, {N\M) and {N{N — 1)|A/) re- 
spectively. The galaxy number density is given by 



rigai = / {N\M) n{M) <1M. 



(9) 



The spatial distribution of galaxies within haloes is as- 
sumed to be spherically symmetric about the halo centre, so 
that the density profile, pgai(T-|M), is a function of r only for 
a halo of a given mass M. The profile is normalized so that 
J /5gai(rjAf) 47tr^dr = 1 (rvir will be defined in equation 
Hat , and the Fourier transform of the normalized profile is 
denoted by Ugai(fc|M). 

We attempt to directly measure the radial density and 
halo occupation distributions in subsequent sections. 



2.4 Galaxy bias 

On distance scales for which the inter-halo term is impor- 
tant, iigai(fcjA/) ~ 1 is a good approximation. The integral 
on the right-hand side of equation |1] is then independent of 
scale and we can re-write the relations as: 

P^^\k)=bl^iP!^^\k), (10) 

where we have defined the galaxy bias parameter as: 



fegai = / dM n(M) fo(M)-^^'^^ 



Mgal 



(11) 



2.5 An empirical approach to the halo model 

As we have stressed, our aim is to rely on observations wher- 
ever possible, avoiding model-dependent assumptions. Our 
underlying assumption is that the observed galaxy groups 
represent the haloes. Even if this assumption is not perfect 
it is very likely that there is a simple ranking relation be- 
tween the observed galaxy groups and the underlying dark 
matter haloes. Although we could take for n{M) the his- 
togram of the observed groups versus their estimated mass, 
we prefer to use the more robust mass function n(M) given 
by equation |S] Future group samples with accurate masses 
will allow us to use them directly for n{M). 

Equation |3| requires knowledge of the occupation quan- 
tities {N\M), {N{N - 1)|M) and the radial profile /9gai(r) 
which we shall determine directly from the 2PIGG sample 
per galaxy type. The mean number of galaxies (equation|^ 
then follows from the above n{M) and {N\AI). 

The halo-halo correlation power spectrum P^'^]^\k) 



could in principle be taken directly from the group-group 
power spectrum. The group-gr oup correlation fun ctions 
have actu ally been derived by iPadilla et alj i2004ri and 
lYang et"al.. (2005b ) . Ifowever, biases in mapping the groups 
to haloes may make this approach somewhat inaccurate with 
the present data. While this should be possible with fu- 
ture group samples here we follow the approximation given 
by equations 1101 and 1111 In fact, as shown later in Sec- 
tion It). II we find that fegai is close to unity, interestingly 
in accord with the biasing de rived from the 2dF linear 
galaxy-galaxy power spectrum jPercival et al. |200l|) com- 
bined with pre-WMAP CMB measurements Lahav et alJ 
l2002h . In practice we use equations 1101 and 1111 with the lin- 
ear power spectrum of the dark matter, P^]^\k), which has 
been well constrained by observations of the Cosmic Mi- 
crowave Background (CMB). The galaxy power spectrum 
on linear sc ales has also be en measured observationally, for 
example by IPercival et alJ JloOl) from the 2dFGRS, and 
iTegmark et all J2002l) from the Sloan Digital Sky Survey. 
The shapes of these power spectra have been shown to be 
consistent with a flat universe A-CDM matter power spec- 
trum, with present ep och ^ 0.3 ( ^ Percival et aL .20011 : 
lEfstathiou et al.ll200a ICole et alJl2005^ . We therefore as- 
sume this form for Pi!i''''(fc), a dopting the WMAP normali- 
sation as = 0.9 llSpergel et al. 2003) and the biasing param- 
eter derived in Section f6. II 

We emphasize again that although we have to com- 
promise here by making several theoretical assumptions, it 
would be possible in the future to use galaxy and group cata- 
logues to test the halo model almost without any theoretical 
prior. 



3 GALAXY GROUP CATALOGUE 

We assume that galaxy groups are representative of the 
underlying dark matter haloes. The 2dF GRS Percolation- 
Inferred Galaxy Group catalogue ('2PIGG: lEke et al.l20043) 
is currently the largest homogeneous sample of galaxy 
groups publicly available. It comprises ~29,000 groups con- 
taining at least two galaxy members, which host a total of 
~105,000 galaxies. As described in the following subsections, 
we apply a number of cuts to the catalogue in order to im- 
prove the quality of the sample used in our analyses (our 
final sample comprises ~ 3000 groups). 

An independent attempt at constructing a g alaxy group 
catalogue for the 2dFGRS has been made by lYang et alJ 
f 2005a), using a halo model-based algorithm. Although it is 
not the aim of this work to perform a comparative study 
of the two catalogues we note relevant differences where ap- 
propriate. 

The 2PIGG groups were identified from the 2dFGRS by 
means of a friond s-of-friends (EOF) percolation algorithm. 
lEke et aO (l2004af) tested the algorithm on mock samples 
generated from cosmological dark matter simulations in or- 
der to optimize the mapping between the recovered galaxy 
groups and the actual dark matter haloes; this strengthens 
the case for our prior assumption that galaxy groups may be 
identified with the true bound structures in the dark matter 
dist ribution. 

lEke et alj ^2004a^ tuned their group-finding algorithm 
so as to maximize the completeness of the recovered groups. 
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Consequently, very few true group members are erroneously 
excluded, but this is at the expense of increased contamina- 
tion by interlopers (i.e. field galaxies assigned to groups, or 
false groups comprised entirely of field galaxies). The sever- 
ity of the contamination increases with redshift; following 
lEke et al.l ^0^, we discard groups at redshifts greater 
than z = 0.12. At this redshift the total number of field 
galaxies included in groups rises to ~ 50 per cent of the total 
number of true group members, and the number of interlop- 
ers increases very rapidly with redshift beyond this point. 
This cut leaves ~16,000 re mainin g groups. We note that the 
halo-based group finder of lYang e t al. ( 20053) achieves sim- 
ilar completeness to standard FOF but reduces the contam- 
ination level, typically by a factor of two (for group masses 
M > 10^''/i"^Mq). 

3.1 Spectral classification 

iMadgwick et al.l ll2002f) used a principal component anal- 
ysis of the 2dFGRS dataset to define, rj, a continuous 
parametrization of spectral type. This parameter is most 
strongly correlated with the current star formation rate 
in each galaxy, but is also a goo d indicator for morpho - 
logical type and colour. Following IMadgwick et al] l)2003fl . 
we broadly classify galaxies in our sample using a cut at 
rj — —1.4. We label galaxies with rj > —1.4 (relatively ac- 
tive) as blue galaxies, and those with ri < —1.4 (relatively 
passive) as red. We exclude from our analysis any group 
which does not have a measurement of rj for all its member 
galaxies. This reduces the sample to ~14,000 groups. 

3.2 Mass estimation 

The majority of the 2PIGGs have a measurement for the 
one-dimensional velocity dispersion, CTv This can be used to 
estimate the group mass as 



M = A 



(Tv-Rrr 

g" 



(12) 



where -Rrms is the r.m.s. projected separation from the cen- 
tral galaxy of the remaining galaxies assi gned to th e group . 
Through use of simulated mock surveys, lEke et al.l ll2004al) 
derive the value A = 5.0 by requiring that the estimated 
mass be unbiased with respect to that of the underlying 
dark matter haloes. The simulated dark matter haloes are 
identified using a friends-of-friends algorithm, and the re- 
covered haloes have mean spherical overdensities of ~ 200 
times the background d ensity (Eke, private communication) . 
ISheth fc TormenI lll999l) define the mass of a halo to be that 
enclosed within such an overdensity, thus it is valid to iden- 
tify the mass estimate of equation 1121 with the halo mass 
used throughout Section |5| 

The measurement of the velocity dispersion is very un- 
reliable for groups with a small number of observed galax- 
ies, and even for groups with large memberships we must be 
wary of the significant scatter in the relation bet ween group 
mass and the velocity dispersion (see fig. 3 of lEke et all 
[2004b). Yang et aL ( 2005a) have independently constructed 
a galaxy group catalogue for the 2dFGRS. They investigate 
the reliability of dynamical mass estimates, and propose an 
alternative halo mass assignment based on total group lumi- 
nosity. This gives them a particular advantage at low group 



masses where the dynamical masses are especially unreli- 
able. 

To avoid the poorest mass estimates we retain only 
those groups with at least four observed galaxy members. 
The final sample contains 3,147 groups hosting a total of 
25,118 galaxies (of which 12,851 are blue and 12,267 are 
red). 

In order to maintain internal consistency regarding the 
definition of the group mass, we define the group virial ra- 
dius. 



\^47tAp j 



(13) 



to be that enclosing a spherical overdensity A = 200 times 
the background density of the universe (at the group red- 
shift). 



3.3 Group membership 

In order to estimate the true galaxy membership of the 
2PIGGs it is necessary to correct for (i) the flux limit, and 
(ii) the incompleteness of t he 2dF survey due to e.g. con- 
straints on fibre positioning. lEke et all l)2004al) compute for 
each galaxy a weight, Wj ^ 1, to account for the local in- 
completeness of the survey. These weights only account for 
missed galaxies which are brighter than the local magnitude 
limit, fejjim; the fiux limit is not yet accounted for. 

In order for the group membership to be well-defined 
observationally, a limiting absolute magnitude, Mi,j_com, 
must be specified. We explain in Section |S] how our choices 
for this limit are dictated by the samples used in determin- 
ing the observed correlation functions; the values we use are 
Mi,j,com - 51ogi(,ft = -19.25,-19.04,-19.50 for red, blue 
and all galaxies respectively. 

The faintest detectable absolute magnitude at distance 
X is given by M^^^ {x) = ftj.um - 25 - 51og(a;) - K'^^\x) for 
galaxies of spectral type i, where K^^ ^x) is the iiT-correctio n 
(measured for each spectral type bv IMadgwick et alJl2002ll . 
Of all the galaxies more luminous than the threshold 
Afi,j,com, the fraction which are detectable at distance x is 
given by the selection function: 



^^'\x) = 



(!>'•''> (M) dM 



Z^™" 0W(M) dM 



where Afmax ~ max(M^*-' (a;), Mtj^com)- A further complica- 
tion is that galaxies with bj < 14 were removed from the 
redshift catalogue; the lower integration limit of the numer- 
ator in equation 1141 is adjusted to account for this. 

Since we are dealing with galaxies in groups we choose 
to use luminosity functions (LFs) specific to the populations 
of these ove r-dense regions, rath er than those of the whole 
2dF sample. ICroton et al.l ||2QQ3) have derived 2dF LFs as a 
function of density environment and per spectral type. We 
use their 'cluster' LFs for red and blue galaxies to compute 
the selection function for the 2PIGGs. 

We can now assign to each galaxy a weight w'j, which 
accounts for both the local incompleteness and the luminos- 
ity and fiux limits. This weight is defined as 



(14) 
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{Mb, < Mb J, com) 

(Mb, > Mb,, com) 



(15) 



10^ 



for a galaxy of spectral type i hosted by a group at dis- 
tance X, where Mb, is the absolute magnitude of the galaxy 
in question. The galaxy membership of a group, complete 
to the absolute magnitude limit, may finally be obtained 
by summing these weights over the galaxies assigned to the 
group. 



4 HALO OCCUPATION DISTRIBUTION 

Having estimated the group membership as described in 
Section 13.31 we calculate the mean halo occupation num- 
bers as a function of group mass (Figs Q and I^J. The 
error bars on these points are purely statistical: the un- 
cert ainty in the mass estimat es is not directly accounted 
for. iBerlind fc Weinberd (|20o3) have used simulated galaxy 
group data to gauge the impact of this dispersion on the de- 
termination of {N\M}. Amongst their conclusions they find: 

(i) The exclusion of groups with fewer than four ob- 
served members is responsible for {N\M) being systemati- 
cally overestimated at low masses 

(ii) At higher masses, the measured amplitude is likely 
to be biased relative to the true value. This is due to the 
decrease in group abundance with increasing mass: of the 
groups scattered into a particular mass range, the major- 
ity originate from lower masses (and therefore have a lower 
average occupation number). 

(iii) Despite the scatter, the recovered slope at high 
group masses is not significantly biased (for sufficiently steep 
{N\M); see below). 

In the particular case of the 2PIGG catalogue, the im- 
pact of (ii) will be countered by the relatively high number 
of interlopers (these competing effects are of a roughly sim- 
ilar magnitude). Furthermore, we note that equations El and 
01 do not directly depend on the amplitude of {N\M} since 
Wgai scales identically, so the expected bias in our measure- 
ment is not a great concern in this context (but see Section 

Semi-analytic m odels (e.g. iBenson et alJ I2OO0I : 
ISheth fc Diaferid liooHl unanimously predict that {N\M) 
should tend to a power-law at high masses. The behaviour at 
low masses is expected to be somewhat more complicated, 
particularly when the occupation numbers of red and blue 
su b-populations are co n sidere d separately. Unfortunately, 
as iBerlind fc Weinberd i2002l) predict, we are unable to 
reliably probe the halo occupation distribution for masses 
less than ~ 10^*/i~^Mq. In the interests of keeping our 
results as empirical as possible, we therefore adopt the 
simplest possible model for the mean occupation number: a 
single power-law with a cut-off at low mass. Thus, 



{N\M) 



(16) 



(M/Mo)" (M ^ Mcut) 
(M < Mcut) 

where Mo and /3 are free parameters. IBerlind fc Weinberd 



Note that, after the imposition of the absolute magnitude lim- 



its, Mi 



6j .com 



our sample does in fact include groups with fewer 
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Figure 1. Total galaxy occupancy of 2PIGGs (above the min- 



imum absolute magnitude limit Mb 



Slogio h = -19.50). 



Points represent the individual groups used in the analysis. Open 
circles show the average occupancy in bins of the group mass, 
with 1-sigma statistical errors. The dashed line shows the best- 
fitting power- law (see Table □. The fitting was performed only 
over the range M > lO^'*/i~^M0 since our exclusion of groups 
with fewer than four observed members means that we expect to 
systematically overestimate the halo occupation for less massive 
groups. 



(|2002) suggest that /3 may be reliably recovered, despite the 
dispersion in the mass, for l3 > 0.7. 

Using only the data for groups with M > lO^^/i~^M0, 
we have fitted power-laws to the measured mean occupation 
numbers; the best-fitting parameters are given in Table 
and the fits are overlaid on the results in Figs and |5| In 
this high mass limit the power-law provides a good fit. We 
find /9 > 0.7 for all three samples, so these estimates of the 
high-mass slope ought not to be seriously affected by the 
scatter in the mass. 

Our results for the power-law slope at high group 
masses a re in good agree ment with previous work. For 
example, ISheth fc Diaferid I2 OOI) make simple fits to the 
semi-analvtic models of iKauffmann et al.l 1^99^ and find 
power-laws with /3rod = 0.9 and /9biue = 0.8 (but note 
that their definitions of ' red' a nd 'blue' differ from ours). 
iMagliocchetti fc Porciaiiil i2003r) use the observed 2dF cor- 
relation functions to constrain the HOD. They use the same 
cut on 77 (see Section 13.11 to define red and blue popula- 
tions, and find for their best-fitting models /3rcd = l-l^o 2 
and /3biuc ~ 0.7^g'^ in the high-mass limit. 



4.1 Second moment 

The second moment {N{N - l)\M) of P{N\M) is usually 
expressed in terms of the parameter a{M), where 



than four members. 



{N{N - 1)\M) = a{M) {N\M)^ 



(17) 
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Figure 2. As Fig.0 but showing separately the mean occupa- 
tion numbers for blue and red galaxies (for clarity, the results for 
blue galaxies have been translated up one decade, and individual 
groups are not plotted). 



Table 1. Best-fitting parameters to {N\M) = (M/Mq)'^. Errors 
are 1-sigma, marginalized over the other fitted parameter in each 
case. 





log{J\/o//i-iMQ) 


/3 


All galaxies 

Red 

Blue 


13.57l»;« 

1 3 fi^i+f 13 


Q 00+0.15 



A pure Poisson distribution has a{M) = 1, but semi- 
analytic models suggest that in fact P{N\M) has a sub- 
Poisson distribution, i.e. a(M) < 1, p a,rticularly at low halo 
masse s. iKravtsov et al.l ll2004) and ICasas-Miranda et alJ 
i2002l) discuss models for the sub-Poissonian behaviour of 
the HOD. 

We directly measure {N{N - l)\M) from the 2PIGG 
data and show the results in Fig.|3(in the form of a). Con- 
trary to expectations we find a{M) > 1 across the entire 
mass range for all galaxy types: averaged over all masses, 
a = 1.17 ± 0.06, 1.16 ± 0.06, 1.13 ± 0.05 for red, blue and 
all galaxies respectively. This is undoubtedly an effect of the 
dispersion in the mass estimates: {N{N — 1)\M) describes 
the width of P{N\M), but the large scatter in the mass 
leads to a broadening of the observed distribution and hence 
an overestimate of {N{N — 1)|M). It is especially difficult 
to obtain masses for low-mass, low-occupancy groups, and 




M/h~hlr^ 

Figure 3. Measured values of a(M) (defined in eauation ll7i . for 
blue (circles), red (squares), and all galaxies (triangles). For clar- 
ity, points for blue and red galaxies are displaced slightly either 
side of the true bin mass. 



it is therefore neither surprising nor significant that we do 
not observe the expected sub-Poissonian behaviour in this 
regime. Our measured a{M) can thus only be considered as 
an upper limit. Guided by our observations, and in the in- 
terests of simplicity we assume a{Ad) = 1 for all halo masses 
whe n calculating correla tion functions in Section|S| We note 
that f^ng et al T j2005j) . using their alternative mass deter- 
mination, do find the expected sub-Poissonian behaviour for 
low-mass groups. 



4.2 Extrapolation to low mass 

As we mention above, observational difficulties mean we are 
unable to recover the halo occupation distribution for haloes 
less massive than around 10^'*/i~^Mq. In particular, we can- 
not directly determine the minimum halo mass, Afcut, which 
may host a galaxy. However, under the assumption that the 
HOD is well described by equation 1161 the one remaining 
free parameter, Mcut, can be constrained by requiring that 
we recover the correct average space density of galaxies. 

The actual (observed) galaxy number density may be 
obtained directly from the luminosity function (integrated 
over the range of luminosity required); for this purpose we 
use the 2dF luminosity functions per type (regardless of 
environment) of Madgwick ct al. (200^. Mcut can then be 
constrained by requiring that equation |U] agrees with this 
observed value. 

The values we obtain for i\fcut through this method are 
unphysically low (in fact, for the combined and red samples 
it is not possible to match the observed number density even 
by allowing Mcut = 0). This might be attributed in part to 
th e underestimation of the amplitude of {N\M) predicted 
bv lBerfind fc Weinberd |2002), but is most likely due to the 
overly simplistic HOD we have adopted (equation I16II . It is 
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Table 2. Galaxy number densities (for galaxies brighter than the 
absolute magnitude limits derived in Sectio n 151 computed from 
the luminosity functions of lMadsrwick et al.i (2002). The quoted 
1-sigma errors are estimated from the uncertainties in the param- 
eters of the luminosity functions. 



fig (lO-^h^Mpc-S) 


All galaxies 


5.06 ± 0.49 


Red 


3.90 ±0.35 


Blue 


4.80 ±0.27 



more usual to have (N\M) tend asy mptotically to unity at 
low mass (e.g. iKravtsov et"ai]l2004) . significantly increas- 
ing the number of galaxies hosted by haloes in this mass 
range relative to the pure power-law we have adopted. We 
consider the impact of these issues on the halo model corre- 
lation functions in Section r6.2l For the purposes of Sectional 
we adopt Mcut ~ when using the simple power-law HOD 
(equation I16I I. 



5 RADIAL DISTRIBUTION OF GALAXIES IN 
GROUPS. 

We wish to obtain the group galaxy density profile given 
the group mass: pgai(r|M). Unfortunately the virial motions 
of galaxies in groups give rise to redshift distortions (e.g. 
the so-called fingers-of-god) , so that redshift information 
alone is not suitable for determining the three-dimensional 
galaxy distribution in groups. We can therefore only measure 
the surface density projected along the line-of-sight, Egai(-R) 
(where the upper-case R denotes projected distance from the 
group centre) . Only if spherical symmetry is assumed can we 
una mbiguously infer p sai(r\M) from Egai(-R). 

'Ek e et all J2004al) systematically label one member of 
each 2PIGG as the central galaxy. For each group we calcu- 
late the projected angular separation from the central galaxy 
of the remaining group members. Unfortunately, even for 
the richest groups, there are insufBcent galaxies to allow the 
radial distribution to be recovered with reasonable signal- 
to-noise for an individual group. We therefore scale the pro- 
jected separations in each group by its virial radius to allow 
like-for-like stacking of groups over a range of masses. The 
total surface density of galaxies is then calculated by sum- 
ming the weights Wj in radial bins. Summing the weights 
ensures that the profile is correctly normalized; although 
the actual value of the amplitude is of no consequence in 
the context of the halo model, it is interesting to compare 
the relative number densities of red and blue galaxies as a 
function of radius. We find that the shape of the profile is 
not altered if we use a simple number count instead of sum- 
ming the weights. Futhermore, the shape of the profile is 
also found to be insensitive to the limiting absolute mag- 
nitude used and we therefore choose to use a considerably 
fainter limit, Aftj^com — SlogjQ h — —17.5, than used to de- 
termine the group membership, so as to increase the number 
of galaxies admitted and hence improve the statistics. 

The design of the 2dF spectrograph means that fibres 
cannot be positioned within ~ 30" of one another, hence 
there is effectively an exclusion zone of this angular ra- 
dius around the central galaxy in each group. Furthermore, 
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Figure 4. Projected galaxy density profile from stacking all 
groups in our sample, scaled by their respective virial radii. The 
best-fitting NFW profile (c = 3.4) is overlaid. The decline in sur- 
face density within R < 0.05 r^i^ is due to close-pair incomplete- 
ness; the angular diameter selection described in the text ensures 
that its influence is confined to this region for all the groups in- 
cluded in this composite profile. Arrows indicate the radial range 
over which the profile fltting was performed. 



such close-pair i ncompleteness also ex i sts in the 2dF's parent 
APM catalogue: lvan den Bosch et al.l ll2005l) show that these 
have an equally important impact on the measured surface 
density at small separations. The severity of the problem 
is a decreasing function of the apparent angular size of a 
group. We have therefore imposed an additional selection 
on the sample used in this section, discarding all groups 
having a virial angular diameter less than 600". Since fibre 
collisions occur on angular scales < 30" , this confines the af- 
fected region to projected separations from the group centre 
of _R < 0.05 r-vir- This choice of cut-off optimizes the bal- 
ance between the increased statistical noise resulting from 
using fewer groups, and the improved constraints on the 
profile shape achieved by probing nearer to the centre. The 
remaining sample comprises 1,619 groups containing a total 
of 16,749 galaxies. 

The observed surface density profiles are shown in Fig. 
2]for all galaxy types, and in Fig.|K|for blue and red galaxies 
separately. The decline in surface density due to close-pair 
incompleteness in the region R < 0.05 r^h is very appar- 
ent. We note also that the surface density in not uniformly 
zero outside r^ir, as it ideally ought to be since the group- 
finding algorithm only admits galaxies which are deemed to 
be bound to a group (i.e. they lie within that group's virial 
radius). This is due mainly to the scatter in the mass esti- 
mates, which translates into a scatter in r^h so that there 
is uncertainty in the radial scaling of the groups. We assess 
the impact of this effect in Appendix IA2I 
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Figure 5. As Fig. |3 but showing separately the results for red 
(circles) and blue (squares) galaxies. Red galaxies have a more 
concentrated profile than blue galaxies, and dominate the number 
counts towards the group centre. 



5.1 Profile fitting 

Obtaining an analytic fit to the observed profiles will greatly 
facilitate their use in the halo model. Simulations suggest 
that dark matter halos obey a universa l density profile of 
the form jNavarro. Frenk fc Whit^ll997^ 



p{r) = 



r/vs (1 + r/rs) 



(18) 



where rg is a characteristic scale radius, and Ps sets the 
amplitude of the profile. This NFW profile is often alter- 
natively expressed in terms of the concentration parame- 
ter, c = rvir/fg. While there is no physical prerequisite that 
galaxies should follow the sa me profile as the dark matter , 
similar previous studies (e.g. iLin. Mohr fc Stanford! l2004l) 
have found the NFW profile to provide a good fit to the ra- 
dial distribution of galaxies, and it therefore constitues the 
most promising initial guess for the profile. 

In order to attempt fitting of the NFW profile to our 
observations we require its plane projection. 



E(i?) = 2 



p(r) 



r dr 



i?2 



(19) 



In the case r-max = oo, eauation ll9l has an analytic solution 
(e.g. Bartolmarm 1996), but we require rmax = r^ir since we 
assume the profile to be truncated at the virial radius. We 
therefore derive the projected surface density profile numer- 
ically from eauation ll9l with the integration region bounded 

at the virial radius^ 

Jjag ai & Kr avtsovl (|200^ discuss the problem of degen- 
eracy between the concentration parameter and amplitude, 
Ps, of the NFW profile. They suggest that the amplitude be 
re-expressed in terms of the total number of galaxies con- 
tained within the virial radius: 



Table 3. Best-fitting NFW concentration parameters. Errors are 
1-sigma, marginalized over the probability distribution of the 
other fitted parameter. A^vir represents the average number of 
galaxies within Tvir per group (above the minimum absolute mag- 



nitude Mh 



51ogio/i = -17.5). 





A'vir 




All galaxies 


39.8 ±0.8 


2.4 ±0.2 


Red 


21.7 ±0.6 


3.9 ±0.5 


Blue 


18.2 ±0.5 


1.3 ±0.2 



iVvir = 



p{r) 47rr^dr = A-Kpsrl,^^^ 



(20) 



where g{c) = ln(l + c) — c /( 1 + c) . 'Na gai fc Kravtsovl Jic 
were interested in fitting to simulated data for which A^vir 
could be trivially obtained, reducing the problem to a one- 
parameter fit. The peculiar velocity distortions in the 2dF 
redshifts make it impossible to determine A^vir reliably for 
the 2PIGGs, hence we are forced to allow the amplitude of 
the profile to be a second free parameter. We prefer to use 
this form none the less, since c is far less degenerate with 
Ai'vir than it is with ps, and we find A'^vir a more intuitive 
parametrization of the amplitude. 

As discussed above, the reliability of the galaxy surface 
density measurements deteriorates towards the centre of the 
groups. We therefore restrict the profile fitting to points in 
the range 0.05 < R/r^ir < 1.0. We find projected NFW 
profiles are capable of describing the observed profiles very 
well over the radial range fitted, both for the separate blue 
and red galaxy profiles and the combined galaxy profile. The 
best-fitting NFW profiles are overlaid on the data in Figs|l] 
and 1^1 and the concentration parameters are given in Ta- 
ble |3 Fig. |S| shows the joint probability distribution for the 
NFW concentration parameter and amplitude fitted to the 
data for all galaxies; we find there to be no strong degener- 
acy between the two parameters. 

It is a non-trivial assumption that stacking projected 
NFW profiles with a range of concentration parameters 
should result in a composite profile that is still NFW in 
shape, and which has a concentration parameter indicative 
of the 'average' of the underlying sample. We therefore test 
this assumption in Appendix^ For a lognormal distribu- 
tion of c we find that the composite profile retains the NFW 
shape, but it has a concentration parameter which is lower 
than the median c of the underlying sample. The severity of 
the effect increases as the distribution of c becomes wider. 

The best-fitting concentration parameters we give in 
Table El are broadly consistent with the analyses of galaxy 
clusters in the ■ ft'-band of the 2 Micron All-Sky Survey by 
iLin et alJ I2OO?) who obtain c = 2.90to?i, and of 14 CNOC 
clusters by Carlborg ct al. ( 1997) and van de r Marcl ct ^l 
^2000) who find c = 3.7 and 4.2 respectively. |H ansen et alJ 
(2004) find c = 1-3 for photometrically identified clusters in 
the SDSS. 

We caution that the close-pair incompleteness effects 
described above mean that we cannot reliably constrain the 
radial distribution for R < O.OSrvir, so it is impossible, for 
example, to determine whether the profile is cuspy or flat- 
cored at its centre. Furthermore, a number of effects, some of 
which we have already mentioned, are expected to cause the 
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Figure 6. Contours of for the concentration parameter and 
amplitude of the NFW profile fitted to the projected galaxy den- 
sity profile for all galaxies. Marginalized percentage confidence 
levels are indicated. 
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Figure 7. Variation of group concentration parameter with 
mass, for red (squares), blue (circles), and all galaxies (triangles). 
For comparison, the dashed line shows the trend in the mean con- 
ce ntration param eter of dark matter haloes in simulations (due 
to lBullock et alfc oOlV 



measured concentration parameters to be depressed relative 
to the true value: 

(i) stacking groups (Appendix IXll. 

(ii) the uncertainty in the radial scaling of groups (Ap- 
pendix 

(iii) if we are still affected by close-pair incompleteness 
despite the angular diameter selection we have used, and 

(iv) if there are discrepancies between the position of the 
'central' galaxy and the 'true' group centre. 



5.2 Mass dependence 

The mean concentration parameter of dark matter halos in 
simulations is found to be a decreasing function of mass, 
typically modelled as a power law, c = cp (M /M«)~'^ with 
A ~ 0.1 and cq ~ 10 (e.g. iBuUock et alJl200j) . 

Fig. [7| shows the measured concentration parameter 
against group mass for red and blue galaxies separately, 
and for the combined sample. These values were obtained 
by stacking groups within mass bins, and fitting NFW pro- 
files as above. The results suggest a detection of a slight 
decrease in c with mass, but the trend is barely significant. 
When calculating halo model-based correlation functions in 
the following section, we therefore choose to incorporate no 
mass-dependence in the radial galaxy distribution; we con- 
firm the safety of this assumption belo w in Section 16.21 

The comparison with the results of lBullock et al J (I2OOII) 
reveals that, even for the red galaxy distribution, we find 
the galaxy radial distributions to be significantly less con- 
centrated than those of average dark matter haloes in sim- 
ulations (but beware the caveats at the end of Section [5. 111 . 
The suggestion of a negative correlation of concentration 
with group mass is in qualitative agreement with the re- 



sults of iHansen et alJ (|2004h , who find c to be a decreasing 
function of galaxy occupation number. 



6 PREDICTING CORRELATION FUNCTIONS 

Armed with the observations of the previous sections, we 
are now equipped to calculate the halo model pr edic- 
tions for the correlation function. 'Haw kins et alJ ll2003t) and 
[Madgwi ck et al. ( 2003i) have measured the two-point corre- 
lation functions for the full 2dFGRS sample and for red and 
blue subsamples. Care must be taken to ensure a fair, like- 
for-like comparison, since the correlation func tion is known 
to depend on redshift and luminosity (e.g. IZehavi et alJ 
120021: iNorberg et al. 2002a '). For this reason, the observed 
correlation functions are labelled by their effective luminos- 
ity and effective redshift. These effective quantities are pair- 
weighted measures, since the correlation function is based on 
counting pairs of galaxies; hence, for example, the effective 
redshifts - Zs — 0.15 for the all-galaxy sample, and Zs — 0.11 
for both the red and blue galaxy samples - are somewhat 
higher than the median redshifts of the samples. We use 
these redshifts when calculating the halo model correlation 
functions for the respective populations via the formalism 
of Section |5| 

In the case of the luminosity, the situation is compli- 
cated by the fact that the 2dF samples are flux-limited, 
whereas the halo model formalism effectively assumes 
volume-limiting. For a volume-limited sample the effective 
luminosity is identical to the mean luminosity. The minimum 
luminosity (required by eauation ll4ll is therefore determined 
by solving 




Figure 8. Projected two-point correlation function for galax- 
ies of all types. Data poi nts rep resent the observed correlation 
function of Hawkins ct al] i2003l) and the dot-dashed line is their 
power-law fit to these points. The solid line is our halo model- 
based prediction, and the short-dashed lines show the contribu- 
tions from the individual one- and two-halo terms in the model. 
The lower panel shows the same data normalized by the power- 
law fit to the observations. 



(21) 



for Z/min- The e ffectiv e lu minosities of the samples used by 
iHawkins et"all 1)200.^^ and iMadgwick et al.l (|200,?h are, us- 
ing M* — Slogj^p/i = —19.66: all galaxies, Lg = 1.4L«; 
red, Lb — 1.261/,; blue, Ls — 0.951/,. The resulting 
limiting absolute magnitudes are Mtj^com — Slog^g/i — 
— 19.25, —19.04, —19.50 for red, blue and all galaxies respec- 
tively. These limits have been used in determining the halo 
occupation distribution of the 2PIGG groups (Section llj, 
and the number densities of the galaxy populations (Table 

In order to negate the effect of redshift distortions in the 
observed correlation functions we use the radial projection. 



H(a) 



^(r) r dr 



(r2 



2)1/2 



(22) 



In Figs Isl l9l and Hill we compare the halo model predictions 
with the observed projected correlation functions. 

In producing these halo model correlation functions 
we have simply used the best-fitting halo profiles and oc- 
cupation distributions, deliberately making no attempt to 
fit to the observed correlation function data. We find that 
the model successfully recovers the approximate shape and 
amplitude of the observed correlation functions across the 



Figure 9. As Fig. |^ but for blue galaxies only, and using the 
observed correlation function data of lMadgwick et alj i2003tl . 




Figure 10. As Fig. |5] but for red galaxies only. 
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Table 4. Galaxy bias parameters predicted by the halo model. 
Note that the effective luminosities and rcdshifts differ between 
the three galaxy samples. 





^gal(-^'s, ^s) 




All galaxies 


1.05 


0.92 


Red 


1.23 


1.12 


Blue 


0.85 


0.81 



range of distance scales considered. In particular, the model 
correlation functions trace the obvious divergence from the 
power-law form at large scales. The different observed HODs 
and halo profiles of red and blue galaxies derived in Sections 
|3]and[n]are seen to give rise to the expected relative strengths 
of correlations on small scales, although the halo model pre- 
dictions do show significant deviations from the power-law 
form on these scales. 



6.1 Galaxy bias on large scales 

On large scales the differences in the correlation functions 
of the different galaxy classes are encapsulated in the galaxy 
bias parameter (equation IIUI . To allow comparison with 
other results, we correct the galaxy bias parameter for red- 
shift and luminosity dependence via 



fegal(I/, 0) = fogal(-£', Zs)D{Zs) 

and 

6gai(L, 0)/6gai(L., 0) = 0.85 + 0.15(L/L. 



(23) 



(24) 



The first relation follows from the assumption that galaxy 
clustering evolves weakly over this redshift range (i.e. fg^gai 
remains approximately constant ). The second i s found from 
correlation function analysis bv iNorberg e t al. ("2001"). The 
predic ted galaxy bias factors are in Table 2] Laha v ot"all 
(|2002) have measured the galaxy bias factor by combining 
the 2dFGRS galaxy power spectrum with the post-WMAP 
CMB data and found 



6gai(-L,,0) ^ 0.96 ±0.08, 



(25) 



in good agreement with our predicted value for all galaxies: 
6gai(L.,0) = 0.92. 

Furthermore, the relative bias between our red and blue 
samples, bred/febiue = 1.4, i s in excellent agreement with that 
found bv lCole et alj (|200^, who measure the power spectra 
for red and blue 2dFGRS subsamples defined by a colour 
cut at bj — rp = 1.07. 



6.2 Sensitivity to parameters 

The halo bias factor, h{M), (equation |HJ is a monotonically 
increasing function of mass, with the lowest mass haloes in 
fact being anti-biased. The galaxy bias factor predicted by 
the halo model (eauation llH will therefore be controlled by 
the relative occupation of high to low mass groups: thus, 
since the red galaxies have a steeper {A'^|Af) than the blue, 
they also have a greater large-scale bias factor. 

The differences on small scales are, in principle, due to 
the different radial profiles as well as the relative halo oc- 
cupation numbers of the galaxy classes. However, it turns 



out that on the scales we are currently able to access obser- 
vationally (and given the observational results of the previ- 
ous sections) the correlation function is much more sensitive 
to t he differences in the halo occupation distribution (see 
alsolSheth et alJbOOlt iBerUnd fc WeinberdEooi: IScrantonI 
l2003h . For example, substi tuting the dark matte r concentra- 
tion parameter relation of iBuUock et alJ (1200 ll . see Section 
15.21 . in place of the observed values has a perceptible effect 
only on scales r < 3 x 10~^Mpc//i. 

As we explain in Section 14.21 the HOD for low mass 
groups is unconstrained by the observations. In consequence, 
we are forced to assume an extrapolation of the high mass 
HOD to low masses. The shape of the intra-halo term of 
the correlation function is strongly dependent on the par- 
ticular form assumed for the halo occupation distribution 
at low group masses, but the overall amplitude is robust 
to such details, being dominated by the contribution from 
galaxy pairs in high-mass haloes. iBerhnd fc Weinberell20o3 
demonstrate the contributions to ^(r) from galaxy pairs as a 
function of halo mass: in general lower mass haloes become 
infiuential on smaller scales. By using a more sophisticated 
extrapolation to low masses it is possible to produce a halo 
model correlation function which more faithfully reproduces 
the power-law form on small scales. For example, we show 
in Fig. Illl the correlation function for all galaxies produced 
by using a halo occupation distribution of the form 



{N\M) 




(M ^ Mo) 

(Mcut ^ M < Mo) 

(M < Mcut) 



(26) 



with Mo and l3 as given in Table Q and introducing an ad 
hoc sub-Poisson distribution at low masses: 



a(M) 



1 (M > Mo) 
0.2 (M < Mo) 



(27) 



As described in Section 14.21 we constrain Mcut by match- 
ing the observed number density. For this choice of HOD 
we obtain a much more realistic Afcut = 1.4 x 10^'^/i~^Mq. 
Both this and our simplest possible HOD (equation 1161 are 
entirely consistent with our observations, and are success- 
ful at recovering the amplitude of the correlation function. 
However, the more sophisticated form gives a much better 
fit to the observations on small scales. 



7 DISCUSSION 

Motivated by the halo model, we have used the 2dF 
Percolation-Inferred Galaxy Group catalague to make di- 
rect observational estimates of the distribution functions of 
galaxies within and amongst haloes. We have found a pro- 
jected NFW profile provides a good fit to the composite pro- 
jected surface density of galaxies in red and blue subclasses, 
and for the combined sample. The radial distribution of the 
red galaxies was found to be the most concentrated towards 
the group centre, but, even for these, the concentration is 
significantly lower than for dark matter haloes measured in 
simulations. 

We have shown that at high masses the mean occupa- 
tion number of the 2PIGGs with red, blue, and all galaxies 
can be described by a power-law in the mass, with the num- 
ber of red galaxies increasing most steeply with the mass. 
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Figure 11. As Fig.|^ but the halo model predictions are based 
on the more sophisticated HOD described by equations 1261 and 
I2YI This form of the HOD better reproduces the power-law on 
small scales, while still being consistent with the observational 
constraints of Section HI 



The difficulty of obtaining reliable dynamical mass estimates 
for groups with low occupation numbers made it difficult to 
constrain the halo occupation distribution in this regime. 

These observations have allowed us to test the halo 
model against fully self-consistent observations: the same 
2dFGRS galaxy populations used to compile the 2PIGG 
catalogue have previously had their correlation functions 
directly determined. Despite the uncertainty regarding the 
form of the halo occupation distribution at low halo masses, 
the halo model proves successful at reproducing the near 
power-law form of the galaxy correlation functions. Most 
impressively, the model also succesfuUy recovers the relative 
biasi ng of the red and blue subpopulations. In agreement 
with ISheth et alJ i200ll) . we identify the halo occupation 
number, {N\M), as the dominant factor contributing to the 
differences in the correlation functions of red and blue galax- 
ies: the HOD for red galaxies gives a greater weighting to 
high-mass haloes leading to a stronger clustering signal on 
all scales. The differences in the radial distributions of the 
galaxies in groups are found to be relatively unimportant 
on the distance scales we are presently able to access obser- 
vationally, but do begin to exert influence at the smallest 
scales considered. 

There are of course any number of forms which could be 
chosen for th e extrapolation of the HOD to low mass (see 
for exarnple. iKravtsov et alj I200I ISheth fc Diaferid liooH: 
IScrantod 120031: iMaeliocchetti fc Porcianil bOO^T We have 
shown that the power-law form of the observed correlation 



functions does place constraints on this extrapolation. How- 
ever, in the intra-halo term there is certainly degeneracy be- 
tween a{M) and {N\M), so it is unlikely that the two-point 
correlation can be used to constrain a unique solution. Direct 
measurement of the HOD for low-mass haloes requires much 
improved mass estimates, and represents a difficult observa- 
tional challenge. Higher order clustering statistics represent 
a more stringent test for the halo model, and offer the possi- 
bility of indirectly constraining the low-mass HOD in a more 
model-independent fashi on than is possible from the two- 
point function alone fe.g.'Ma fc_jYj200(]|JScoccimarr c) et alJ 
I2OOI: Wang ct al. 2004: Fosalba. Pan fc Szapudi 20oK 
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APPENDIX A: STACKING GROUPS 

In order to allow accurate measurement of the galaxy radial 
distribution it is necessary to combine many groups so as 
to improve the statistics. Even the largest groups in the 
2PIGG catalogue do not have sufficient galaxy members to 
allow the concentration parameter to be well constrained for 
individual groups. 

The groups are scaled by their virial radius. However, 



Table Al. Results from fitting NFW profiles to stacked profiles 
having concentration parameters drawn from lognormal distribu- 
tions with the given median (c) and width (o"c)- The concentration 
parameter of the best-fitting NFW profile, cgt, is found to be a 
fair indicator for the mode of the underlying distribution. 
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NFW profiles with different concentration parameters are 
not self-similar when scaled in this way (scaling by rs, on 
the other hand, does produce self-similar profiles, but this 
approach is not practical since estimating rs is non-trivial). 
It is not clear therefore that the resulting profile will retain 
the NFW form, or that the concentration parameter thus 
obtained will be at all indicative of the 'average' concentra- 
tion of the groups in the sample. 

We do not have any indication of the expected distri- 
bution of the concentration parameter for galaxy groups, 
due to the difficulty in measuring the profile for individual 
groups. For dark m atter haloes, simulations (e.g. 'jin^'2000|; 
iBullock et all200lh suggest that c has a lognormal distribu- 
tion at fixed halo mass. 



p{c\M)dc = 



27ro", 



■ exp 



(In c — In c) 
2^1 



d In c 



(Al) 



where the median, c, contains the only dependence on halo 
mass (see EU- Both Jing (2000) and Bullock et al. (20qJ) 
find (Tc ~ 0.2. Although there is no reason to assume that 
these results apply equally well to the galaxy group profiles, 
we use this distribution in order to illustrate the effect of 
stacking profiles with a wide range of concentrations. 

Stacking profiles drawn from a distribution p(c) will 
produce the aggregate profile, 



T,{R) = / E(i?|c) p(c) dc. 



(A2) 



We have computed this aggregate profile for a range of log- 
normal distributions, and fitted a projected NFW profile to 
the result, just as for the real data (i.e. using only the region 
0.05 < R/r^ir < l.O). 

The stacked profiles can be well fitted by the NFW pro- 
file, even for the widest distributions. However, the concen- 
tration parameter of the best-fitting profile is systematically 
lower than the median of the underlying distribution. Table 
lAll shows the results for the range of distributions consid- 
ered. 

The disparity between the median and fitted c becomes 
increasingly significant as the distribution broadens. How- 
ever, the best-fitting c is found to be a fair indicator for 
the mode (i.e. peak position) of the underlying lognormal 
distribution, particularly when the dispersion is relatively 
low. 
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Figure Al. The crosses show the simulted aggregate profile 
obtained by stacking NFW profiles drawn from a lognormal dis- 
tribution with median c = 5 and width ctc = 0.2. The dotted 
line is the NFW profile which best fits the stacked points; it has 
c = 4.66. 

Al Mass dependence 

iBullock et al.l ll200ll) find the median concentration param- 
eter for dark matter haloes to be a function of halo mass: 
c(M) « 9(M/M*)"° " at 2 = 0. In combination with the 
mass function, n{M) (equation]^, this allows us to write 
down the global distribution of c as 

/(c) = J p{c\M) n{M) AM. (A3) 

Figure IA2I shows this global distribution for haloes in 
the mass range 10^^ < M/H'^Mq < 10^*^, assuming p(c|M) 
is lognormal with = 0.2. The combination with the mass 
function leads to a slight widening of the distribution, par- 
ticularly on the low-c tail. 

This result clearly depends on the assumption of forms 
for c(M) and p{c\M), both of which we have derived from 
simulated dark matter haloes. Our observational results 
(Fig. |7|l suggest that Cgai depends on the host halo mass 
no more strongly than does the dark matter concentration 
parameter. 

What is most important is that the combination with 
the mass function has not significantly altered the shape of 
the distribution, so that the results of Section ^ still hold 
when we stack groups covering a wide range of masses, albeit 
with a slightly wider underlying distribution of c. 

A2 Scaling uncertainty 

The dynamical masses determined bv lEke et al.l (|20o3) for 
the 2PIGG groups have a large scatter, particular l y for sm all 
(low membership) groups. Figure 3 of lEke et alJ l^2004b^ il- 
lustrates the dependence of this scatter on group member- 
ship. Since low-membership groups are the most abundant 
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Figure A2. The solid curve is the global distribution of c for 
dark matter haloes (eauation lA3l . For comparison, the dotted line 
shows a lognormal distribution with width = 0.2 and having 
the same mode as the global distribution. 

in the 2PIGG catalogue, one might expect the scaling error 
to be dominated by their contribution. Fortunately however, 
the impact on the stacked profile of groups with a particular 
membership. A'', is proportional to Nxn{N). Thus, although 
there are many more low-membership groups, the individ- 
ual groups have proportionally less impact on the stacked 
profile than a single high-membership group. It turns out 
that N X n{N) is nearly independent of N for the 2PIGG 
catalogue, so the overall scaling error receives roughly equal 
contributions from all group memberships. 

We adopt A = 10 as our fiducial point, since the er- 
ror distribution here is close to average. The distribution of 
M/MtrvLc at A = 10 is roughly lognormal with median 1.26 
and o ~ 1.0. Propagating the error distribution of M/Aftruc 
through eauation ll3l implies that r /rtruc has a lognormal dis- 
tribution with median 1.08 and g ~ 0.33: the rvir oc M^^^ 
relation means the large scatter in the mass estimates is 
considerably suppressed, leading to a somewhat narrower 
distribution of r /rtruc (Fig. IA3l . 

The aggregate profile obtained by stacking profiles with 
an uncertain radial scaling is given by 

E(i?) = y" E (I 1 c) p{8)ds, (A4) 

where we have defined the scaling error, s = r /rtruc, and 
its probability distribution p{a) which, as we have discussed 
above, may be approximated by a lognormal distribution. 
This profile is illustrated in Fig. IA4I 

The scaling uncertainty has the effect of smoothing the 
cut-off at the virial radius, just as observed in Section|3 and 
causes a significant reduction in the measured concentration 
parameter. 
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Figure A3. The dotted line is the error distribution in the group 
mass estimates (derived from eauation ll2i for groups having N = 
10 observed galaxy members. Through equation IlijI this implies 
an error distribution in the virial radius, shown by the solid line. 
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Figure A4. Effect of scaling uncertainty on the aggregate profile 
obtained by stacking groups. The solid line shows a projected 
NFW profile with c = 5.0. Crosses show simulated data generated 
by eauation lA4l assuming the scaling uncertainty described in the 
text. The dotted line is the NFW profile which best-fits the data; 
it has c = 4.13. 



